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ABSTRACT 

The formation of the open promoter complex (RPo) 
in which the melted DNA containing the transcrip- 
tion start site is located at the RNA polymerase 
(RNAP) catalytic centre is an obligatory step in the 
transcription of DNA into RNA catalyzed by RNAP. In 
the RPo, an extensive network of interactions is 
established between DNA, RNAP and the a-factor 
and the formation of functional RPo occurs via a 
series of transcriptional intermediates (collectively 
'RPi'). A single tryptophan is ideally positioned to 
directly engage with the flipped out base of the 
non-template strand at the +1 site. Evidence 
suggests that this tryptophan (i) is involved in 
either forward translocation or DNA scrunching 
and (ii) in <r 54 -regulated promoters limits the tran- 
scription activity of at least one intermediate 
complex (RPi) before the formation of a fully func- 
tional RPo. Limiting RPi activity may be important in 
preventing the premature synthesis of abortive tran- 
scripts, suggesting its involvement in a general 
mechanism driving the RPi to RPo transition for 
transcription initiation. 

INTRODUCTION 

Gene expression is controlled and regulated at many 
stages, e.g. epigenetically, transcriptionally, translation- 
ally and through post-translational modifications. 
Transcription initiation is particularly amenable to regu- 
lation owing to the large number of binding or catalytic 
reactions occurring and owing to the complexity of the 
molecular interfaces existing between the many factors 
involved. Fundamentally, the mechanics and chemistry 
of transcription are conserved across the three domains 



of life and are set by key functional properties of the 
RNA polymerases (RNAPs) and their associated tran- 
scription factors (1,2). However, some functions of the 
transcription apparatus are organism specific. For 
example, understanding the precise details of the tran- 
scription mechanism of a species and how they differ 
from those of others opens up options for targeted drug 
development against pathogens or formulation of 
pathway specific inhibitors (3). 

Bacterial transcription is essentially regulated by 
two classes of sigma (a) factors, a 70 whose eponymous 
member controls the transcription of housekeeping genes, 
and ct 54 for the expression of genes whose products coun- 
teract stress situations (4-7). Both classes of a-factors bind 
core RNAPs (E) to form holoenzymes (Ecr) that then bind 
to specific classes of promoters and initially form closed 
promoter complexes (RPcs) with a fully duplexed DNA 
template. Subsequently, transition into a transcriptionally 
competent open promoter complex (RPo) occurs. Ea 
holoenzymes undergo the required conformational 
changes spontaneously. Ea 54 , on the other hand, relies 
on the action of a bacterial enhancer binding protein 
(bEBP). These bEBPs use adenosine triphosphate (ATP) 
hydrolysis to remodel RPc for the formation of melted 
DNA in the RNAP active centre where the start site is avail- 
able in single-stranded form and templated RNA synthesis 
can occur (4,6,8-12). A pathway to RPo formation from 
the RPc has been proposed in which a bEBP-bound Ea 54 
complex, called RPi, is formed in which ATP hydrolysis 
is yet to occur (13). 

Multiple interactions are established between the 
RNAP and the a-factor, the a-factor and the DNA 
template/non-template strands and the RNAP and the 
DNA template/non-template strands (14-16). These inter- 
actions precisely position the +1 start site DNA in the 
active centre, and their making and breaking accommo- 
date the structural rearrangements, such as melting of the 
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DNA template and RNAP clamp closure. The clamp is 
formed by the p' subunit, and by closing over the active 
centre cleft it holds RNA and DNA in place during tran- 
scription (17,18). To achieve this, these interactions need 
to be dynamic and highly coordinated. 

We have previously described the contribution of the 
RNAP switch regions to the opening of the transcription 
start site by demonstrating that transcriptional defects 
caused by point mutations in the switch regions can be 
rescued by pre-opening the +1 site (19). A recent struc- 
tural study suggests that the non-template strand +1 site 
is contacted by residue (3W183 of the p subunit of 
Escherichia coli RNAP (Figure 1C). According to this 
model, which is derived from a cr 70 -dependent RNAP 
and promoter, the +2 G of the non-template strand fits 
into a pocket formed by the p subunit of RNAP and sub- 
sequently p W183 undergoes stacking interactions with the 
+ 1T. This arrangement is in agreement with previously 
reported cross-linking data (14,16). 

We considered that the start site opening mechanism for 
a 54 re gulated transcription that is controlled by the switch 
regions might ultimately converge on p\V183 in E.coli 
RNAP. Through strong stacking interactions with the 
+ 1 site, the pW183 residue could act to pull or capture 
the non-template strand away from the template strand 
potentially making its +1 site accessible to the incoming 
nucleotide. However, no biochemically detectable pheno- 
type has been reported for an alanine substitution mutant 
of p\V183 (16). The difficulties to detect a phenotype with 
PW183A may reflect a caveat of gulated transcrip- 
tion where the RPi states are short-lived and transient and 
therefore barely accessible to biochemical investigations. 
The a 54 -regulated system, on the other hand, with its re- 
quirement for ATP hydrolysis to progress from the RPi to 
the RPo, allows us to trap the Ea 4 -bEBP complex at 
a point just before ATP hydrolysis, which is believed to 
represent one state of the RPi. Hence, the cr 54 -regulated 
system may be much better suited to identify phenotypes 
that affect the RPi than is the gulated system. 

We anticipate that some similar interactions to those 
reported for a 70 are established in a <r 54 -dependent 
system, and, here, we provide evidence that for enhancer- 
dependent transcription PW183 keeps the transcribing 
activity of an intermediate complex (RPi) in check to 
prevent transcription from occurring prematurely during 
the RPc to RPi and to RPo interconversions. Finally, we 
propose that PW183 may have an additional role in 
mediating template forward translocation or DNA 
scrunching during transcription elongation. 

MATERIALS AND METHODS 

Mutagenesis 

Single amino acid substitutions of the E. coli rpoB gene 
(pIA545) were introduced by site-directed mutagenesis 
and reconstituted in pVSlO as previously described (19,20). 

DNA probes and proteins 

DNA templates representing the Sinorhizobium meliloti 
nif'H or the lacUV5 promoter were 5'- 32 P-labelled (where 



appropriate) and annealed to the complementary strand 
as described (22). Sequences of the promoter constructs 
are listed in Supplementary Table SI. Escherichia coli 
RNAP [wild-type (WT) and PW183 variants], PspFi_ 275 
and Klebsiella pneumoniae a 54 were overexpressed in E. 
coli and purified as described (9,23,24). Escherichia coli 
a 70 was a gift from S. Wigneshweraraj, PspF D108A 
was a gift from N. Joly and PspF F85A was a gift from 
N. Zhang. 

Eo -54 formation assays 

A total of 1 uM 32 P-HMK- a 54 (gift from N. Joly) were 
incubated with 25-100 nM RNAP (WT or switch variants) 
and 500 ug/ml a-lactalbumin in standard Tris-acetate 
(STA) buffer [25 mM tris-acetate (pH 8.0), 8mM magne- 
sium acetate, 10 mM KC1, 3.5% (w/v) PEG6000] at 37°C 
for 15 min. The reactions were quenched by adding native 
buffer, and the Ea 54 holoenzyme complexes formed were 
analysed on 4.5% non-denaturing gels. Products were 
visualized and quantified using a Fuji FLA-5000 
Phosphorlmager. For quantitation purposes, the PSL 
transfer scale was adjusted using the AIDA software, 
and for illustration purposes, the Curves scale was 
adjusted using Photoshop CS3. All Ea 54 formation 
assays were performed as titrations at three different 
RNAP concentrations (25 nM, 50 nM, 100 nM). The titra- 
tions were conducted to ensure that the observed effects 
were not distorted due to assay saturation effects. For 
each RNAP concentration, the activity was calculated 
relative to WT activity (in %). This provided us with trip- 
licate results for the relative activities of each variant, 
which were then averaged, and standard deviations were 
calculated. 

Minimal scaffold assays 

Specific promoter-independent transcription assays 
('minimal scaffold assays') were performed as described 
(24) using 100 nM of a minimal scaffold template that 
was constructed by first annealing the DNA strands 5-G 
GTCCTGTCTGAAATTGTTATCCGCTAC (template) 
and 5-ACAATTTCAGACAGGACC (non-template) to 
which a priming RNA molecule of either seven bases 
(5-GUAGCGG) or eight bases (5-GUAGCGGA) in 
length was annealed. These assays permit determination 
of the catalytic activity of the RNAP variants in the 
absence of any requirements for promoter recognition or 
interaction with a factors. The reactions were carried out 
in STA buffer. RNA synthesis was initiated by addition of 
25, 50 and lOOnM core RNAP (WT or W183 variants) 
and 1 uCi of ot 32 P-UTP. The reactions were incubated at 
37°C for 15 min, quenched by addition of loading buffer 
and analysed on 20% denaturing gels as described earlier 
in the text. 

Inorganic pyrophosphate assay 

In the post-extension state, RNAPs are capable of 
catalysing the cleavage of the terminal nucleotide from 
a transcript on addition of inorganic pyrophosphate 
(PPi). This activity is pyrophosphorolysis and is one 
of the proofreading mechanisms of RNAP (25). 
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Figure 1. RW183 is highly conserved in bacteria and acts in a complex network in the active centre. (A) ClustalW analysis of bacterial, archaeal and 
eukaryotic species illustrate a strong conservation of PW183 in bacteria, with no counterpart in the corresponding archaeal or eukaryotic RNAP 
subunits. (B) In a model of the TEC PW183 is in close proximity to the non-template strand near the active centre (21). (C) PW183 engages in a 
complex network between the RNAP, the cr-factor and the nucleic acids in the active centre and interacts with the non-template +1 site [adapted 
from (16)]. 
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Pyrophosphatase assays were performed as described (20) 
by using lOOnM of the minimal scaffold assembly con- 
taining the a 32 P-labelled 7nt RNA in the presence of 
0.5 mM pyrophosphate. Reactions were initiated on 
addition of 25 nM, 50 nM or 100 nM core RNAP, 
incubated at 37°C for lOmin and analysed on 20% 
denaturing gels. 

Binding assays 

Minimal scaffold and promoter binding assays were con- 
ducted as described (24,26,27). Native gel mobility shift 
assays, both on the minimal scaffold and on ni/'H 
promoter probes, were conducted in STA buffer in a 
total reaction volume of 10 ul containing 25, 50 and 
100 nM core RNAP or Ea 54 (reconstituted with a 54 as 
described earlier in the text) and 25 nM a 32 P-labelled 
probe, which was incubated for 15min at 37°C. RPos 
and RPi were formed as described earlier in the text. 
The reactions were analysed using a 4.5% native gel run 
at 100 V for 55 min. The gels were dried and protein-DNA 
complexes were analysed as described earlier in the text. 

Small-primed RNA assays 

Promoter-dependent small-primed RNA (spRNA) assays 
(a form of abortive initiation assay in which only a subset 
of ribonucleotide triphosphates [rNTPs] are present) were 
performed as described (26) in STA buffer, in a 10 -ul 
reaction volume containing 25-100 nM Ea 54 
(reconstituted using 40 uM of a and 25, 50 and 100 nM 
RNAP) and 0.5 mM dATP. ADP.A1F trapped complexes 
for RPi formation were formed as described (26) in STA 
buffer, in a 10 -ul reaction volume containing 25-100 nM 
Ea 54 (reconstituted using 40 uM of a and 25-100 nM 
RNAP), 2mM dADP and 2.5 mM NaF. The samples 
were initially incubated at 37°C for 15 min to form Ea 54 
complexes. Then, 25 nM double-stranded variations of the 
Sinorhizobium meliloti nifH promoter were added for 
10 min at 37°C. Open complex formation was initiated 
by adding 7uM PspF^vs for 10 min at 37°C. For 
trapped complexes (to make RPi), 0.2 mM A1C13 was 
added for 10 min at 37°C. Finally, synthesis of spRNA 
(UpGGG) was initiated by adding an elongation mix con- 
taining 200ug/ml heparin, 0.5 mM UpG and 1 uCi a 32 P- 
guanosine triphosphate (GTP). The reactions were 
incubated at 37°C for 20 min and quenched by addition 
of loading buffer. Portions of the samples were analysed 
on a 20% denaturing gel and visualized and quantified as 
described earlier in the text. 

RNA retention assay 

The spRNA assays were carried out as described earlier 
in the text, but the sample was split after the elongation 
step. One portion each was mixed with native or 
formamide loading dye for analysis under native or 
denaturing conditions, respectively. 

Molecular dynamics 

The transcription elongation complex (TEC) model struc- 
ture was analysed by fully atomistic conventional molecu- 
lar dynamics and by an enhanced sampling method 



[accelerated MD (aMD); (28)] using the AMBER12 simu- 
lation package. The fully parameterized and charge- 
neutralized model was initially equilibrated at 300 
Kelvin in a TIP3P water octahedral waterbox at 
constant pressure and subjected to 7.5 nanoseconds (ns) 
of graphics processing unit (GPU)-supported conven- 
tional molecular dynamic (isothermal-isobaric [constant 
Number, Pressure, Temperature] ensemble) in Single 
Precision Double Precision hybrid calculation mixed pre- 
cision mode (29). To explore conformational changes 
occurring for a longer period, aMD was performed with 
extra boosts to dihedral and whole potential energies, 
resulting in an enhancement of simulation efficiency by a 
factor of ~2000 (28). The diehedral and total energy 
boosts were calculated as described (28) based on the 
energy values observed after the initial 7.5 ns. 

The resulting trajectories were analysed and visualized 
using Visual Molecular Dynamics (30). 

RESULTS 

PW183 is highly conserved in bacteria, but not in 
archaeal or eukaryotic species 

Cross-linking and structural data using Thermus 
thermophilics RNAP and a 70 strongly suggest (3W183 
interacts with the +1 site on the non-template strand 
during transcription initiation (14,16). Sequence align- 
ments of bacterial RNAP p subunits with archaeal B or 
B" and eukaryotic Rpb2 sequences highlight a strict con- 
servation of this residue across all bacterial species 
examined (Figure 1A). However, no orthologs of p\V183 
were found in either the archaeal or the eukaryotic RNAP 
subunits. As cr-factors are unique to bacteria, the presence 
of this residue in bacteria without an obvious counterpart 
in the archaea or eukaryotes may reflect a mechanism that 
involves all or a subset of cr-factors at promoters. 
Structural analyses illustrate that p W183 is within close 
proximity of the non-template strand at a distance of 
~40 A from Metal A of the catalytic site, suggesting that 
its influence on the actual catalytic reaction is limited or 
non-existent. PW183 may engage with the non-template 
+1 site indicating a potential role in start site opening 
(Figure 1A and B). These structural data were used and 
extrapolated to explain the findings in our Ea 54 -dependent 
systems, which are illustrated below (Figure 1C). 

PW183 variants are catalytically active but defective in 
binding minimal scaffolds and holoenzyme formation 

Single residue substitutions were created at position 
pW183 to investigate the residue's effect on various steps 
in the transcription initiation process using the E. coli 
RNAP. An alanine substitution was created to remove 
the aromatic while preserving mild hydrophobic pro- 
perties, a serine substitution introduced hydrophilic 
properties and aspartate and arginine substitutions 
introduced negative or positively charged sidechains, re- 
spectively. We tested these mutants in a number of in vitro 
assays to assess their catalytic activity and their capacity 
to bind DNA and to form holoenzymes (Figure 2). 
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Figure 2. Catalytic and binding properties (A) pW183 mutants are defective in nucleic acid binding and holoenzyme formation in comparison with 
wild-type RNAP (wt). The catalytic defects of the variants can in part be explained by the defects in scaffold binding and holoenzyme formation. The 
assays were performed as titrations at three different RNAP concentrations (25 nM, 50nM, 100 nM). The activity for each mutant was calculated 
relative to the WT activity. The data represent average activities, and the error bars represent standard deviations. For each reaction, 33 nM of oc 32 P- 
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RNAP) or 17nM (lOOnM RNAP), respectively. (B) Schematic of the minimal scaffold assay measuring RNA elongation (minimal scaffold assay) 
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In minimal nucleic acid scaffold-directed transcription 
assays, an RNA primer was hybridized to a DNA scaffold 
and extended by a single radiolabelled nucleotide. 
PW183A and PW183S performed WT-like in these 
assays, whereas the activity of PW183D and (3W183R 
was reduced to 20-70%. These observations are consistent 
with those of others (16) and also with pyrophosphate 
catalyzed RNA cleavage assays where the terminal nucleo- 
tide of the RNA primer is cleaved on addition of PPi 
(Figure 2). The RNA extension and cleavage activities in 
PW183 variants establish that p\V183 is largely dispens- 
able for the catalytic function of RNAP. Stronger defect 
phenotypes were observed in nucleic acid binding and also 
in Ecj 54 formation assays where all four mutants showed 
only between 10 and 65% of WT activity (Figure 2). 
Again, the defects of PW183D and PW183R were more 
pronounced. The nucleic acid-binding defects can account 
for the apparent reduction in catalytic activity, suggesting 
that the mutants are fully catalytically active. The defects 
in scaffold and cr-factor binding likely reflect changes in 
the complex network of interactions between the scaffold, 



the RNAP and the cr-factor that PW183 has been 
postulated to participate in (16). 

The failure to properly engage with nucleic acids and 
a 54 is directly reflected in the mutants' modestly 
compromised ability to form RPcs that reached 60-95% 
of WT levels (Figure 3). With Ea 54 , it is possible to stably 
form an intermediate complex by the binding of its 
cognate activator to RPc to form RPi. The RPi can be 
trapped by adding the nucleotide analogue ADP-A1F, 
which mimics the conformation of the nucleotide just 
prior to ATP hydrolysis (26). Formation of the RPi with 
Ea 54 by adding PspF and ADP-A1F to the RPcs were not 
affected, and for all RNAP variants comparable fractions 
(40^5%) of the RPc shifted into the RPi (Figure 3). 
When allowing the activator-driven ATP hydrolysis 
reaction to proceed and RPos to form, the defects 
became more pronounced with the mutants forming 
20-60% RPos in comparison to WT (Figure 3). This sug- 
gested that, in addition to their defects in binding DNA 
and a 54 , the mutants are also impaired in their capacity to 
form RPos with the two effects being additive. A 50% 
reduction in the amount of mutant as compared with 
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Figure 3. Promoter-binding properties. PW183 mutants are mildly 
defective in RPc formation, which can be explained by their general 
defects in holoenzyme formation and DNA binding. Although RPi 
formation is not affected, the mutants display additive defects in RPo 
formation. For RPc and RPo, the bars reflect the amount of complex 
that was formed relative to WT. For RPi, the bars reflect the fraction 
of RPc that has been converted to RPi for each variant (also see 
Supplementary Figure SI). 



WT RPos was demonstrated by forming RPos, followed 
by adding heparin as a competitor and the removal of 
samples at regular time intervals to estimate remaining 
RPo levels. The stability of the complexes formed 
towards heparin did not seem to be affected by the muta- 
tions (Supplementary Figure SI A, right). The presence of 
the initiating dinucleotide UpG did not significantly sta- 
bilize the RPos, thus ruling out the possibility that the 
defects in RPo stability occur at the level of base pairing 
to the template strand (Supplementary Figure SI A, left). 

RPc formation for Eo 7 has been reported to be more 
efficient on certain truncated promoter probes reflecting 
the energetic costs of making interactions with the down- 
stream fork junction structure formed around +1 (31) 
(Supplementary Figure SIB, bottom). A similar effect 
was not observed with Ecr 54 , and its cognate nifH 
promoter probes truncated at —5/— 5, +2/+2 and +6/+6, 
presumably reflecting the lack of any extensive down- 
stream fork junction formation in the Ecr 54 RPcs. 
Formation of —5/— 5 RPcs was comparable with RPc for- 
mation on WT/WT templates, whereas +2/+2 and +6/+6 
templates formed only a third of the number of RPcs. This 
suggests that some energetically favourable interactions 
before RPo formation are made by Ecr 54 with sequences 
downstream of +1. RPc formation on all four templates 
was at least three times lower than on a (— 10— 1)/WT 
template, which contained a mismatch between the non- 
template and template strands from the —10 to the —1 site, 
mimicking a transcription bubble (Supplementary Table 
SI). Also, the signals obtained on templates other than the 
(— 10— 1)/WT template were weak. Although the mutants 



apparently recovered their ability to form RPcs on these 
templates in comparison with WT, the overall weak signal 
intensity does not permit unambiguous quantitative con- 
clusions about such a recovery effect (Supplementary 
Figure SIB, top). Nevertheless, it appears that promoter 
DNA sequences downstream of the start site do impact on 
the observable phenotypes of (3W183 variants in promoter 
binding assays. These experiments also demonstrate that 
in the Ecr 54 -dependent transcription system, the presence 
of the downstream DNA is absolutely essential for the 
formation of functional RPos. 

Start site opening does not rescue mutant activity in 
promoter directed transcription 

We next tested Ea 54 promoter-directed transcription 
activity by carrying out spRNA transcription assays 
using a linearized nifH promoter probe with a mismatch 
from —10 to —1 [(— 10— 1)/WT], thus creating an artificial 
transcription bubble. After formation of RPcs and 
addition of the activator protein PspF and ATP, a di- 
nucleotide RNA primer (UpG, —1+1) was extended by 
one or two labelled nucleotides to form an abortive 
trimeric or tetrameric product (26). In these assays, 
(3W183A and PW183S performed WT-like, whereas the 
activity of pW183D and (3W183R was reduced to -40% 
(Figure 4). These results reiterate and reinforce the 
promoter binding observations described earlier in the 
text (Figures 2 and 3). The more severe defects of 
the W183D and W183R mutants are probably due to 
the greater structural impact on the introduction of side 
chain charges, whereas the W183S and W183A, which 
maintain the neutral charge of the tryptophan cause less 
severe structural disruptions. To investigate whether the 
observed changes were due to altered affinities of the 
RNAP variants to GTP, we performed spRNA assays 
under high nucleotide concentration conditions 
(Supplementary Figure S3). On addition of higher concen- 
trations of GTP, we observed a marked increase in the 
production of abortive transcripts. This increase was ini- 
tially greater for the WT than the mutants. All five 
variants reached saturation at 50 uM cold GTP, but al- 
though the activity of the W183 mutants remained stable 
or even slightly increased at higher concentrations, the 
activity of the WT declined. This inhibition effect was 
probably due to inhibition caused by GTP competing 
with UpG for the start site (Supplementary Figure S2). 

Because of the reported interaction of PW183 with the 
+ 1 site (16), we reasoned that a possible contribution to 
start site opening seemed likely. Previously, we 
demonstrated a role of the switch regions in start site 
opening by extending the artificial transcription bubble 
of our (— 10— 1)/WT template to include the +1 and +2 
sites (32). Using such templates in assays with switch 
region point mutants led to a striking recovery from 
their defects in promoter-directed transcription. We 
concluded that the defects of these mutants are due to a 
failure in start site opening, which can be compensated for 
by bypassing +1 site opening (32). 

No such recovery effect was observed with the (3W183 
mutants that performed in exactly the same manner as on 
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the standard (-10-1)/WT template (Figure 4). On fully 
duplexed templates where only the start site and its 
flanking residues were melted, we observed more 
dramatic defects, and the mutants reached 10-75% of 
WT activity. This is consistent with the previously docu- 
mented greater difficulties of using duplexed templates for 
making RPo (32) (Table 1). 

The WT nifH promoter harbors a purine residue — G — 
at the non-template strand +1 site. In our templates with 
the pre-melted start site, we had substituted G by T to 
create the mismatch. To assess the requirements for a 
specific type of base in the +1 position, we generated add- 
itional templates with A or C in the non-template strand 
+ 1 site (Figure 4). Again, we observed similar phenotypes 
as on the (-10-1)/WT template with (3W183A and 
PW183S performing WT-like and (3W183D and pW183R 
being reduced to 50-90% (Figure 4). Finally, we restored 
the non-template strand +1G and maintained the 



Table 1. Comparison of the activity of WT RNAP on promoter tem- 
plates used relative to its activity on the (— 10— 1)/WT template 



Promoter template Activity of WT RNAP relative to 

activity on (-10-1)/WT template (%) 



WT/WT 


5 ± 1 


(-10+ 1)/WT 


303 ± 99 


(-10 + 2)/WT 


280 ± 73 


(-10+ 1A)/WT 


363 ± 143 


(-10+ 1Q/WT 


233 ± 22 


(-10+ 1G)/+1T 


61 ± 16 


(-1)/WT 


64 ± 13 


(-1+11/WT 


182 ± 33 


(-l+2)/WT 


181 ± 29 


Numbers represent averages of three samples ± standard deviation. 
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mismatch by substituting the template +1C by T. Here, 
the (3W183A and (3W183S mutants had higher activity 
than the WT (150-180%), whereas the (3W183D and 
pW183R mutants remained suppressed at 50-60% 
(Figure 4). These experiments argued against PW183 
playing a sole role in start site opening. It is possible 
that the 150-180% increase with W183A and W183S 
and decrease to 50-60% with W183D and W183R on 
(— 10+1G/1T) substrate is due to W183 making inter- 
actions with the incoming dinucleotide. Substitution of 
W183 with alanine and serine may change the RNAPs 
preference for binding the UpA over the UpG dinucleo- 
tide. Also, the (— 10+1G)/1T template was less efficiently 
used (~60% efficiency in comparison with (— 10— 1)/WT) 
than all the other templates with a pre-melted template 
[200^100% efficiency in comparison with (-10-1)/WT] 
(Table 1). 

We considered that the differences we observed in the 
performance of the mutants were due to a delayed tran- 
script release. To investigate this possibility, we performed 
an RNA retention assay on both the (— 10— 1)/WT and the 
(— 10+1A)/WT template: The spRNA assay was carried 
out as usual but after the elongation step, the reaction 
was split and mixed with either native buffer for separ- 
ation of the reaction products on a native gel or with 
denaturing loading dye to analyse the total transcript on 
a denaturing gel. Under native conditions, we observed a 
high-molecular weight band representing Ect 54 , in prin- 
ciple with either a P-GTP or UpGGG* bound to it. 
This complex was more abundant with the W183 
mutants, in particular (3W183A and PW183S 
(Supplementary Figure S3A). This was true for both tem- 
plates with absolute amounts of the labelled species (i.e. 
bound plus released product) mirroring the efficiency with 
which the respective templates were used (Table 1). The 
proportion of the retained species was accounted for a 
maximum of 0.3% of the total transcript. For this 
reason, differences between the W183 mutants and the 
WT were extremely difficult to score. It was not possible 
to establish with certainty whether the retained species 
represents bound 32 P-GTP or UpGGG*. In conjunction 
with results showing that the apparent affinity of the 
mutants for the nucleotide is lower than that of the WT, 
the retained species is likely to be UpGGG*. Although the 
release of the RNA transcript may not in itself be a major 
contributing factor to the reduced performance of 
PW183D and pW183R (Supplementary Figure S3A), its 
increased retention may indicate a change in product 
binding. 

PW183 limits the transcriptional activity of an RPi 

Although RPos must be formed for optimal transcription 
initiation, production of short transcripts occurs from one 
intermediate complex, RPi, formed just before ATP hy- 
drolysis (26). This activity is weak and presumably must 
be suppressed until a stable RPo is formed, which can enter 
the elongation phase and — by implication — productive 
transcription. The RPi is a transient state that is largely 
inaccessible for experimental investigation in Ecr 70 - 
mediated transcription. In contrast, the RPi created when 



Ect -RPc is bound by PspF can be trapped with ADP-A1F 
(26). Trapping can be exploited to explore the transcrip- 
tional activity of the RPi. The key position of pW183 in a 
structural network defining RPo is in stark contrast to its 
failure in producing strong mutant phenotypes in any state 
(RPc or RPo) that can be studied in a CT 70 -regulated system. 
We therefore reasoned that PW183 may predominantly 
function in the RPi state. To test this hypothesis, WT 
and mutant Ect 54 complexes were trapped as RPi (Figure 
5A) by adding PspF and ADP-A1F before performing 
spRNA reactions. Here, the PW183A and PW183S 
mutants displayed superactivity by reaching levels, which 
are up to twice as high as those reached by WT RNAP. 
PW183D and PW183R perform at 75% of WT activity 
(Figure 5B and C), a significantly higher activity than 
under ATP hydrolysis conditions where RPo forms 
(Figure 5A, B and D). These data thus provide clear 
evidence that PW183 functions to limit the transcribing 
activity of the RPi. 

To test the stringency of the requirements for inter- 
actions between the non-template strand and pW183, the 
mismatch of the non-template strand was extended by 1 , 2 
or 3 bases [(-10-1)+1/WT, (-10-l)+2/WT, (— 10— 1)+3/ 
WT]. We also tested a template where the non-template 
strand was shortened by one base [(—10—1) — 1/WT) and 
another template where the entire —10—1 sequence of 
the non-template strand was deleted [A(— 10— 1)/WT] 
(Figure 5B). The rationale of creating these templates 
was to provide alternative interfaces for interactions of 
the non-template strand with the RNAP, thereby 
mimicking different stages during the structural rearrange- 
ments, such as DNA scrunching (33,34). We found an 
increase in mutant activity on almost all these templates, 
which in the case of PW183A and PW183S consistently 
exceeded WT activity (Figure 5B and D). The effect was 
weaker with PW183D and pW183R where superactivity 
was observed only with the (— 10— l)+3/WT template 
(Figure 5B and D). Under trapping conditions used for 
forming RPi, the superactivity effect occurred to the 
same or an even larger extent with the mutants being up 
to 5-fold more active than the WT (Figure 5B and D). 
Similarly, we found that different forms of the RPis 
created by use of several PspF variants altered in their nu- 
cleotide hydrolysis and RPc binding activities also became 
superactive (Supplementary Figure S3B). This 
demonstrated that premature transcription occurs in dif- 
ferent forms of the RPi when PW183 is absent. 

PW183 may be involved in forward translocation or 
DNA scrunching 

Simulations of the molecular dynamics (MD) of PW183 
within the context of a transcription elongating complex 
illustrate a stable interaction of the tryptophan with the 
non-template strand (Figure 6A, Supplementary Movie 
SI). The aMD shows that this interaction can be 
broken, allowing pW183 to dissociate from the non- 
template strand (Figure 6B, Supplementary Movie S2) 
and in a next step engage with PR197 (Figure 6C, 
Supplementary Movie S3). This breaks pR197's contacts 
with the phosphodiester backbone and permits 
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Figure 5. PW183 functions in limiting transcription activity of the RPi. (A) Generating labelled tetramers under trapping (RPi) and ATP hydrolysis 
(RPo) conditions. (B) Template variants used for spRNA assays to assess the effect of non-template strand length. (C) Under trapping conditions, 
the mutants are significantly more active in using templates with different non-template strand length. Assays were carried out in triplicates as 
titrations with 25 nM, 50 nM and lOOnM RNAP. Signal intensity was quantified using the AIDA imaging software and normalized against the total 
signal per lane. Error bars represent standard deviations. (D) The same effect, albeit slightly weaker, is observed under ATP hydrolysis conditions 
(also see Supplementary Figure S3). 



reengagement of PW183 and (3R197 with DNA bases 
further downstream after translocation. The cycle of inter- 
action, dissociation, engagement with R197 and re-en- 
gagement with the DNA is accompanied by structural 
rearrangements in the DNA template. The two residues 
thus seem to be involved in a mechanism that propels the 
DNA template forward and could therefore contribute to 
either the forward translocation mechanism of the RNAP 
or DNA scrunching (Supplementary Movie SI 3). 



DISCUSSION 

A thermodynamic sink drives RPo formation and 
template translocation 

The nucleation of the transcription bubble, completion of 
the transcription bubble and RNAP clamp closure are 
all integral steps en route to the formation of an RPo. 



They are governed by conformational changes of key 
RNAP domains, can be orchestrated through the action 
of sigma factors and their promoter DNA binding inter- 
actions (15) and are therefore subject to genetic regulation. 
Our interest in the mechanism of start site opening in ac- 
tivator-dependent RPo formation had previously revealed 
a major and distinctive contribution of the RNAP switch 
regions (32). (3W183, being a large aromatic residue and 
positioned in close proximity to the non-template +1 site, 
made it a likely candidate to engage in interactions with 
the +1 base (14,16). As such, a role in providing an inter- 
face to compete with the template strand base-pairing 
interactions seemed plausible; (3W183 could act as a 
lever to open up the start site and make it accessible to 
incoming nucleotides. Such a scenario would tie in with a 
relay mechanism that is initiated by the switch regions and 
finally converges onto pW183. The contribution of the 
switch regions to start site opening had been demonstrated 




Figure 6. Molecular dynamics simulation of a TEC shows a cycle of 
engagement of (3W183 (orange) with the non-template strand (green), 
followed by its dissociation and subsequent interaction with PR197 
(blue). The template strand is shown in silver. (A) (5W183 stably inter- 
acts with flipped out non-template strand bases (cyan and purple), 
PR197 interacts with the phosphodiester backbone of the template 
strand. (B) (3W183 dissociates from the non-template strand. 
(C) pW183 engages with PR197 (also see Supplementary Movies SI 
and S2). 
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by using promoter templates with a preopened +1 site, 
which partially rescued the transcription defects of 
switch region point mutations (32). No such recovery 
effect was observed with the PW183 variants, thus 
clearly arguing against a singular role in start site 
opening. Two of the mutants, PW183D and (3W183R, dis- 
played mildly suppressed phenotypes, whereas (3W183A 
and PW183S performed WT-like on most of the templates 
(Figure 4). To exclude any sequence-specific bias on a start 
site opening role of W 183, all four bases were tested in the 
non- template +1 site, but only the non-template +1G in 
combination with a template +1T resulted in a gain-of- 
function phenotype for PW183A and PW183S. Although 
this may suggest a sequence-specific requirement for the 
non-template strand +1 site, the uncertainties caused by 
the additional changes that were required to maintain the 
mismatch do not allow this effect to be unambiguously 
attributed to the non- template strand +1/W183 inter- 
action. We therefore conclude that no strong evidence 
exists for PW183 to actively support start site opening as 
its sole function. 

The aMD studies of PW183 in a transcription elong- 
ation complex, on the other hand, revealed that pW183 
may undergo a cycle of making and breaking interactions 
with flipped out bases of the non-template strand (state A) 
but can also engage with another conserved arginine 
(R197, state B) residue in the beta subunit, which alterna- 
tively interacts with the phosphodiester backbone in state 
A. By creating strong alternating reaction biases towards 
either state A or state B, these interactions may act in the 
fashion of a molecular toggle, which contributes to 
template translocation and/or DNA scrunching. A 
similar role has been proposed for a pocket formed by 
PR151, PI445, PD446, PR451, PL538 and pV547, which 
accommodates the non-template +2 site in the initiation 
complex (16). This pocket finds its structural counterpart 
in the eukaryotic RNAPII system (35) and may represent 
a generic mechanism that recurs during elongation to fa- 
cilitate translocation (36). We propose that the inter- 
actions between PW183 and the +1 base of the DNA 
non-template strand act as a thermodynamic sink 
(Figure 7): In the RPc and RPi, base-pairing interactions 
between the template and non-template +1 site are 
thermodynamically stable (Figure 7 A) but are disrupted 
on establishment of the interaction network within the 
active centre. The equilibrium shift allows the non- 
template + 1 base to flip out (Figure 7B). To fully drive 
the reaction towards the RPo, the now accessible base 
interacts with p\V183 via interactions, which are thermo- 
dynamically favored (Figure 7C). On reaching a thermo- 
dynamically stable state, interactions between other 
RNAP residues and the DNA must be broken to alter 
the thermodynamically stable state of the PW183/+1 site 
interaction and to result in its dissociation. PW183 subse- 
quently interacts with R197 to release it from its inter- 
action with the phosphodiester backbone. Once both 
interactions are broken, the template can slide forward 
to reach another stable state by PW183 interacting 
with another flipped out base further downstream 
(Supplementary Movie S3). 
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dynamic bias towards a stable RPo (also see Supplementary Movie S3). 



PW183 limits RPi activity and prevents the formation of 
bacterial nano-RNAs 

The stabilization of abortive transcripts and their exten- 
sion occurs on RPo formation (37). The bacterial 
enhancer-dependent transcription system allows us to 
capture and study the intermediate states (RPi) en route 
from the RPc to the RPo by trapping the complex prior to 
ATP hydrolysis (26,38). This reveals phenotypes that may 
escape studies performed in the a 70 -dependent 



transcription system where the RPi state is independent 
of activator binding and ATP hydrolysis (5) and thus 
cannot be readily trapped, stabilized and isolated for 
study. Performing trapping reactions for RPi formation 
with the PW183 mutants demonstrated that the amount 
of RPi formed was directly proportional to the amount of 
corresponding RPc, and that this proportion was constant 
across all the mutants and the WT (Figure 3). However, 
transcription activity reactions performed from such 
trapped complexes revealed strikingly superactive pheno- 
types of the mutants, most notably the pW183A and 
PW183S mutants (Figure 5C). Clearly, in the absence of 
the aromatic residue, constraints imposed on the RPi 
activity are lifted allowing extra transcription to occur 
prematurely. We observed a similar effect under ATP hy- 
drolysis conditions with various non-template strand 
lengths (Figure 5D). The different lengths of the non- 
template strand might hinder the formation of the 
RNAP-non-template strand interactions that are 
required to progress into the open complex, and instead 
stall the complex in an RPi like state by mimicking differ- 
ent stages of DNA scrunching and/or promoter DNA 
engagement by RNAP (33,34). Burst and fast permangan- 
ate footprinting experiments showed that the RPo differs 
from the RPi by the position of the non-template strand in 
the active centre, allowing the assembly and DNA binding 
of the clamp and jaw domains (39). Transcription 
occurring in the RPi may limit the conversion to RPo 
by restricting necessary further conformational change in 
RNAP. Recent Forster resonance energy transfer analyses 
have demonstrated substantial flexibility of the transcrip- 
tion bubble in the active centre due to scrunching and 
unscrunching of the DNA template during the early 
stages of transcription initiation. Expansion and contrac- 
tion of the transcription bubble promote the utilization of 
alternative upstream or downstream start sites (40). It 
seems plausible that variability in start site selection may 
result in the production of alternative transcripts. We 
argue that PW183 may be the key in preventing the for- 
mation of such alternative transcripts by limiting the tran- 
scriptional activity of the RPi during the various stages of 
DNA scrunching. Also, biasing towards RPi-dependent 
transcription by providing promoters with variable non- 
template strand lengths allowed the PW183 mutants to 
produce a larger number of short abortive RNAs in RPi 
(Figure 5C), which — along with intermediates of RNA 
degradation (41,42), products of RNA cleavage during 
elongation (43,44) and products of cyclic dinculeotide deg- 
radation (45-48) are one source of bacterial nano-RNAs 
(49-51). Such transcripts with a length of 2 to ~5nt can 
significantly alter the intracellular gene expression profile 
by (i) competing with nucleotides as primers for transcrip- 
tion initiations and extending the 5' end of the resulting 
transcripts; (ii) changing the position where transcription 
begins (52); and (hi) affecting the phosphorylation state — 
and accordingly the stability of a transcript — by changing 
the 5' end (48,53). We propose that under normal condi- 
tions, PW183 keeps RPi activity in check to prevent such 
sporadic and uncontrolled accumulation of bacterial 
nano-RNAs. 



Nucleic Acids Research, 2013, Vol. 41, No. 11 5885 



CONCLUSION 

We have revealed two critical functions of residue (3W183, 
(i) in limiting the transcription activity of the RPi and 
thus preventing the formation of potentially detrimental 
bacterial nano-RNAs or alternative transcripts and (ii) in 
acting in a thermodynamic sink mechanism that creates a 
strong reaction bias towards a fully formed RPo or 
towards a translocated TEC. The thermodynamic 
context that PW183 functions in is likely to play a key 
role in fuelling the formation and disruption of inter- 
actions in a complex matrix that in their entirety drive 
the transcription process. 
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